SMOR: A German Computational Morphology Covering Derivation, Composition and Inflection
نویسندگان
چکیده
We present a morphological analyser for German inflection and word formation implemented in finite state technology. Unlike purely lexicon-based approaches, it can account for productive word formation like derivation and composition. The implementation is based on the Stuttgart Finite State Transducer Tools (SFST-Tools), a non-commercial FST platform. It is fast and achieves a high coverage.
منابع مشابه
Umlaut and Inflection in German
The present paper examines the prosodic constituent Foot as the domain of phonological phenomena in German. Several processes take place in this constituent, such as Glottal Stop Insertion and Final Devoicing, as well as the phenomena that are described below: productive umlaut and infinitive inflection. The status of the trochaic Foot as the unmarked constituent in German is also discussed. Th...
متن کاملMorphological Generation of German for SMT
We participated in the ACL WMT 2009 shared task for translation of German to English, and English to German. We used the Moses open source system, combined with morphological processing. For German to English, we had the only constraint system comparable with the open-data systems. One of the reasons the system performed well was strong reduction of the German vocabulary, through a simplistic c...
متن کاملZmorge: A German Morphological Lexicon Extracted from Wiktionary
We describe a method to automatically extract a German lexicon from Wiktionary that is compatible with the finite-state morphological grammar SMOR. The main advantage of the resulting lexicon over existing lexica for SMOR is that it is open and permissively licensed. A recall-oriented evaluation shows that a morphological analyser built with our lexicon has comparable coverage compared to exist...
متن کاملLearning Morphology of Romance, Germanic and Slavic Languages with the Tool Linguistica
In this paper we present preliminary work conducted on semi-automatic induction of inflectional paradigms from non annotated corpora using the open-source tool Linguistica (Goldsmith 2001) that can be utilized without any prior knowledge of the language. The aim is to induce morphology information from corpora such as to compare languages and foresee the difficulty to develop morphosyntactic le...
متن کاملSyncretism without Underspecification: The Role of Leading Forms
The main goal of this article is to outline a new approach to syncretism in optimality theory, one that does not rely on the concept of underspecification taken over from grammatical theories which do not recognize constraint ranking and constraint violability. The analysis is based on a concept of morphological exponents as leading forms. Instances of syncretism can be traced back to the selec...
متن کامل